An Approach to Selecting Putative RNA Motifs Using MDL Principle
نویسندگان
چکیده
The history of molecular biology is punctuated by a series of discoveries demonstrating the surprising breadth of biological roles of ribonucleic acid (RNA). An ensemble of evolutionary related RNA sequences believed to contain signals at sequence and structure level can be exploited to detect motifs common to all or a portion of those sequences. Finding these similar structural features can provide substantial information as to which parts of the sequence are functional. For several decades, free energy minimization has been the most popular method for structure prediction. However, limitations of the free energy models as well as time complexity have prompted us to look for alternative approaches. We therefore, investigate another paradigm, minimum description length (MDL) encoding, for evaluating the significance of consensus motifs. Here, we evaluate motifs generated by Seed using the description length as a selection criteria. MDL scoring method was tested on four data sets of varying complexity. We found that the scoring method produces competing structures in comparison to the ones predicted with lowest free energy. The top rank motifs have high measures of positive predicted value to known motifs.
منابع مشابه
A New Hybrid Decision Making Method for Selecting Roller Concrete Road Pavement Technology Transfer Method
In today's competitive market, technology transfer is an important problem for firms, organizations and governments. Therefore, making right decisions on selecting a suitable technology and designing an appropriate process to transfer it may have significant influence on the performance of organizations. In this paper, we present a new method to obtain a suitable technology transfer strategy fo...
متن کاملEvaluation of the haplotype motif model using the principle of minimum description
We apply minimum description length (MDL) principles to evaluate the merit of relaxing the rigidity of block models of haplotype structure. We accomplish this by developing an MDL formulation of the more general "haplotype motif" haplotype structure similar to an approach proposed independently by Koivisto et al. [K+04]. Comparison of equivalent block and motif MDL models on real and simulated ...
متن کاملMinimum description length (MDL) regularization for online learning
An approach inspired by the Minimum Description Length (MDL) principle is proposed for adaptively selecting features during online learning based on their usefulness in improving the objective. The approach eliminates noisy or useless features from the optimization process, leading to improved loss. Several algorithmic variations on the approach are presented. They are based on using a Bayesian...
متن کاملStochastic complexity and model selection from incomplete data
The principle of minimum description length (MDL) provides an approach for selecting the model class with the smallest stochastic complexity of the data among a set of model classes. However, when only incomplete data are available the stochastic complexity for the complete data cannot be numerically computed. In this paper, this problem is solved by introducing a notion of expected stochastic ...
متن کاملThe MDL model choice for linear regression
In this talk, we discuss the principle of Minimum Description Length (MDL) for problems of statistical modeling. By viewing models as a means of providing statistical descriptions of observed data, the comparison between competing models is based on the stochastic complexity (SC) of each description. The Normalized Maximum Likelihood (NML) form of the SC (Rissanen 1996) contains a component tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006